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ABSTRACT 

We study in detail the photometric redshift requirements needed for tomographic 
weak gravitational lensing in order to measure accurately the Dark Energy equation 
of state. In particular, we examine how ground-based photometry (u,g,r,i,z,y) can be 
complemented by space-based near-infrared (IR) photometry (J,H), e.g. on board the 
planned DUNE satellite. Using realistic photometric redshift simulations and an ar- 
tificial neural network photo-z method we evaluate the Figure of Merit for the Dark 
Energy parameters {wo^Wa). We consider a DUNE-like broad optical filter supple- 
mented with ground-based multi-band optical data from surveys like the Dark Energy 
Survey, Pan-STARRS and LSST. We show that the Dark Energy Figure of Merit 
would be improved by a factor of 1.3 to 1.7 if IR filters are added on board DUNE. 
Furthermore we show that with IR data catastrophic photo-z outliers can be removed 
effectively. There is an interplay between the choice of filters, the magnitude limits 
and the removal of outliers. We draw attention to the dependence of the results on the 
galaxy formation scenarios encoded into the mock galaxies, e.g the galaxy reddening. 
For example, very deep u band data could be as effective as the IR. We also find that 
about 10^ — 10^ spectroscopic redshifts are needed for calibration of the full survey. 
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1 INTRODUCTION. 

Measuring the nature of Dark Energy has become a central 
part of current studies in cosmology. A variety of methods 
have the potential to probe Dark Energy through its ef- 
fects on both structure g rowth and geometry of the Universe 
(iTurner W hite "1997"; 'Riess et al 'l998; "Perlmutte r et al. 
19991: IHuII1999: Blake k Glazebrook 2003: Seo k Eisenstein 



2003 



Hu Haiman .20031 : IBlake et al.1 120041 ). Future cos- 
mic shear measurements are now widely believed to have 
the greatest potential for con straining the Dark Energy 
equation of state parameter (|Peacock Schneider! l2006l : 
lAlbrecht et a l. 2006). However, to reach this potential fu- 
ture surveys will require tight control over systematic and 
statistical errors. 

Weak- lensing surveys will have to overcome system- 
atic checks such as a reliable point spread function 
(PSF) removal, w ell calibrated shape estimation (e.g. 
iMassev et al. _ 2006lV rernoval o f intrinsic galaxy alignments 
fe^ iKing Schneiderl l200l l2003l : IHcv mans k HeavensI 
2003 ) , shear- shape alignrnent removal (e.g. iHirata Seliak 
2OO4I : iHevmans et"alll2006l : iBridle Abdallall2007f ) and pho- 



tometric redshifts bias corrections, all of which are vital for 
the statistical limits of the test to be achieved. 

A reliable removal of PSF effects can be greatly im- 
proved by using satell ites such as th e Dark Universe 
Explorer (DUNE, Refregier et al. 200 Q) or the Super- 
nova/Acceleration Probe (SNAP, Alde ring et al.1 l2004l l. 
which will take advantage of the image quality achievable 
in space to accurately measure the shapes of lensed galax- 
ies. However, obtaining the multi-band photometry required 
to measure photometric redshifts from space would require 
a substantial mission and may not be necessary. At certain 
wavelengths, obtaining photometry from the ground is much 
faster. Given that large ground based facilities are available 
or being planned the case for obtaining photometric red- 
shifts from the ground and shape measurements (and some 
photometry) form space is compelling. 

In this paper we analyse future weak lensing projects 
with space based imaging for shape measurments plus 
ground based multi-colour photometry. We assess the im- 
pact of the photometric redshift accuracy on the science 
goals of such a mission. We further assess whether the sci- 
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Figure 1. The distributions characterising our simulations. The number density of objects as a function of redshift for the full simulation, 
which has a magnitude limit of 27, as well as the cut simulations with a magnitude limit in the RIZ band of 25. We also plot the relative 
number of galaxy types in the simulation with templates numbered from to 50 according to Sec|2] as well as the amount of reddening 
(Av, the extinction in magnitudes in V band) applied to templates in the simulation. 



ence would benefit significantly by having some photometry 
from space, especially very red optical and infrared bands 
which are difficult to obtain from the ground due to the sky 
brightness at these wavelenghts. 

We begin by describing the catalogue generation upon 
which we base our findings. We then make a full photomet- 
ric redshift analysis with artificial neural networks to assess 
the accuracy of the photo-z obtained in each of the scenar- 
ios considered. We explain the different degeneracies found, 
their origin and how the different survey bands and depths 
help in breaking those degeneracies. Finally we translate our 
photometric redshift errors into a figure of merit for the sci- 
ence considered, in this case for the Dark Energy equation 
of state derived via weak gravitational lensing tomography. 
All magnitudes stated in this paper are AB magnitudes. 



2 CATALOGUE GENERATION. 

In order to simulate the catalogues used in t his work, we 
used t he G QODS-N spec t roscop ic sample from lCowie et al.l 
(|2004l l and IWirth et all (|2004l ). This data set includes a 



R < 24.5 magnitude limited sample along with x-ray, ra- 
dio, and colour selected objects. The majority of objects 
are at z < 1.4. However, a specific effort was made to in- 
clude objects at 1.4 < z < 4. Using these redshifts, we 
have generated a series of galaxy templates from the broad- 
band photomet ry, (U,B,y , R,I,Z, J,H,K,HK') using a method 
similar to Budavari et al. The key difference is that 

we assume a prior set of templates (Cole man et al.l (|1980h 
(CWW) + iKinnev eraP ([1996, ) + intermediate types). We 
assign the following types to templates: type is Elliptical, 
type 10 is Sbc, type 20 is Scd, type 30 is Irr, type 40 is SB3 
and type 50 is SB2. Intermediate types are a linear interpo- 
lation of these types. For instance, type 5 is 0.5E1 + 0.5Sbc. 
We, therefore, can remove reddening from the photometry 
before constructing the templates. Once we have a set of 
templates, the best fit SED and reddening value for each 
object is found. We use the Calzetti reddening law (Calzettil 
ll997l V In addition to reddening, we apply a correction fo r 
intergalactic absorption using the Madau law ( Madaulll995l l. 

We use the model described below for the luminosity 
function evolution in the simulation to estimate the RIZ 
magnitude and redshift distribution. The RIZ filter is as- 
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Table 1. The assumed surveys that we investigate in this work. (Top table) The values quoted are 10 sigma magnitudes for extended 
sources in the AB system. We have taken assumed depths for proposed ground based future imaging surveys and a possible IR survey 
from space. We also simulate a current ongoing survey over a much smaller area to compare our results. (Bottom table) Signal-to-noise 
ratios for a 25.0 RIZ Sbc galaxy for each of the surveys/filters. 



sumed to be a broad filter covering roughly the range 5500 
A to 10000 A. The loc al r band lum inosity function at z = 
is taken as well as the ISteidel et al. 1 ([1999 ) luminosity func- 
tion at z = 3. We linearly interpolate between them in 
redshift space. At 2; > 3 we assume L* and the faint end 
slope are constant, but the volume density which decreases 
to IQ-^/i^Mpc-^ at z = 10. We run a Monte- Carlo simula- 
tion which draws the RIZ magnitude from this distribution 
and the reddening and spectral types from the GOODS-N 
(|Cowie et alJ l2004l ) distribution, which should close for a 
DUNE-like survey since the GOODS spectroscopic limit is 
R < 24.5. We note here that the mock is an extrapolation 
for R > 24.5 because GOODS is not complete for magni- 
tudes fainter than this level. We argue that this is not a 
large effect because the DUNE magnitudes that we consider 
are not much fainter than these magnitudes, hence there will 
only be a small extrapolation for galaxies between R > 24.5 
and RIZ < 24.5. 

We then calculate fluxes for the galaxy based on the 
redshift, SED type, reddening and filter profiles, normalising 
to the RIZ magnitude sampled. Finally, we add Gaussian 
noise to the fluxes, then estimate the magnitudes and errors 
from the fluxes with noise in the same way as a photometry 
package would. We plot in Fig [T] the statistical properties of 
the catalogue, i.e. the galaxy distribution as a function of 
redshift, reddening and type. 

The final catalogue contains galaxies which are com- 
plete in the simulation out to a magnitude limit of 27.0 in 



RIZ. However given the surveys we are to simulate we cut 
the catalogue depending on the photometry available. One 
of the purposes of this study is to assess the impact that 
space based IR photometry will have on the estimate of Dark 
Energy parameters. For this purpose we study a fiducial sur- 
vey which can be achieved by a satellite with a 1.2m mirror. 
With a 1500s exposure this would reach depths in the IR 
of around 23.0 and would reach a depth in optical bands 
of around 25.0 in AB. These depths would be feasible with 
a mission such as DUNE (see Table[T|). However, this does 
not restrict our study as it is applicable to any other wide 
field imager that would provide similar data although this 
is a difficult task from the ground. Hence, for an analysis of 
a DUNE like survey we obtain a cut catalogue which has a 
lOcr detection in the RIZ filter. Unless we state otherwise we 
cut the catalogue at an RIZ magnitude of 25. Correspond- 
ing photometry is available for other filters obtained from 
the ground which may be more or less noisy than the RIZ 
detection. 

3 PHOTOMETRIC REDSHIFTS 

3.1 Estimating photo-z via neural networks 

Essentially there are two approaches to obtain reliable pho- 
tometric redshift s. Template methods compare the colours 
found with the photometric data for each galaxy with the 
colours that templates would predict were these templates 
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Figure 2. Scatter plots of photometric redshifts as a function of the true redshifts for some of the surveys considered in Sec j3.2l We have 
shown the galaxies which have photometric and spectroscopic redshifts below 3.5. We have considered optical surveys with increasing 
depth. The shallowest is a survey with depth similar to DES, followed by a hypothetical survey with depth similar to Pan-STARRS, and 
finally LSST. We also consider two hypothetical optical surveys with very deep exposures in optical bands, especially in the very red 
bands, one of which has a very deep u band exposure. We assess in this figure how the inclusion of deep IR data obtained from space 
would enhance the photometric redshift estimation. 
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Figure 3. Density maps in the Zspec — Zphot plane. An alternate representation of the data shown in Fig|2] The figures are colour coded 
according to the local density of points. The colour scheme is exponential; this means that a colour difference which is different by one 
unit according to the scale means that the density is a factor of e ~ 2.718 smaller. We have shown the galaxies which have photometric 
and spectroscopic redshifts below 3.5 but the neural network fit has been done over the entire range of galaxies available. 
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placed at different redshifts. Training methods attempt to 
map out a function which would translate magnitudes to a 
single photometric redshi ft. A more d etailed description of 
methods can be found in Csabai et al ] 12003) and references 
within. 

It is well studied and accepted that template methods 
are more versatile and can be applicable when no spectro- 
scopic data is available. However, methods that use training 
set methods are more reliable and produce better photomet- 
ric redshifts. 

In this work we use artificial neural networks (ANNz), 
a training set method which has been shown to produce 
competiti ve results compared to ot her training set methods 
available (|Collister Lahavl l2004h . ANNz is a supervised 
neural network training tool. It requires a training set which 
is the data used to optimise the cost function 

E = ^(2;phot(c, irifc) - 2;train,fc)^, (1) 

k 

with respect to the free parameters ('weights'), c, where the 
sum is over the galaxies in the training set which determines 
the goodness of fit of the training set and irifc are the magni- 
tudes of each galaxy . If the data is noisy, a validation set is 
also required in order to prevent over-fitting. This is another 
portion of the data which also has spectroscopic information 
available but that is not included in the training process. It 
is solely used to provide the error function to be minimised. 

The remaining freedom left in a neural network analysis 
is the architecture of the network. A simple architecture is 
easier to minimise but may not provide the best fit to the 
data. On the other hand a complicated architecture may re- 
main stuck in a local minimum of the cost function more 
easily and hence not provide the best solution to the prob- 
lem either. We do not attempt to optimise the architectures 
for each scenario as we consider that this work does not have 
as an aim to judge the performance of different photometric 
redshift techniques. A network with architecture N:2N:2N:1 
(i.e. which has N inputs, two hidden layers with 2N nodes 
each and only one output estimating the redshift, and where 
only adjacent layers are interconnected) has been shown 
to work well on photometric data (iCollister k, Lahavl |2004| ) 
where N is the number of different photometric bands avail- 
able. Therefore we choose this architectur e in all our scenar- 
i os. Fo r details about the architecture see lCollister k, Lahavl 
liooi) and references therein. 

3.2 Impact of the photometry depth and bands 

In this section we compare the photometric redshift quality 
we get for different choices of survey depths and different 
choices of filters for the simulations we have presented in 
Sec|2] The aim of this section is mainly to assess the impact 
of the u band, IR bands and the redder optical bands (z and 
y) on the output of a photometric redshift code. 

We have chosen five optical baseline surveys. One which 
has a depth equivalent to what the Dark Energy Survey 
(DES) will be able to provide, roughly going down to a 
magnitude limit of 24 in the four bands (griz); another is an 
equivalent to what the collection of four Pan-STARRS tele- 
scopes will provide, potentially obtaining an order of magni- 
tude increase in depth in the five bands (grizy); the third is 
an estimate of what an optical survey with a Large Synaptic 
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Table 2. The cres (defined by the interval in which 68 per cent 
of the galaxies that have smallest Zphot — Zspec lie within) and 
a (defined by (^{zphot — Zspec)'^^) for different choices of depths 
and filters. A comparison of these two numbers assesses how many 
outliers there are in the sample and whether the distribution of 
photometric redshifts for a given spectroscopic redshift interval 
is Gaussian or has broad tails. We note that in order to obtain 
reliable photo-z for galaxies with an RIZ depth of 25, shallower 
surveys such as DES or Pan-STARRS are not well matched to 
DUNE, hence the scatter is large. We note that shallower sur- 
veys would not use such faint galaxies for lensing. Deeper optical 
surveys are necessary to reproduce good photo-z on a galaxy- 
by-galaxy basis, however IR data considerably improves even the 
shallower surveys. We also include one line based on photomet- 
ric redshifts from RIZ band only, as a baseline for comparison 
to other surveys and one line assuming an ideal survey with a 
shallower u band down to 25 so illustrate the increment accuracy 
that the u band gives from 24 to 26. 

Survey Telescope (LSST) would achieve, obtaining another 
order of magnitude increase in depth in 6 bands (ugrizy). 
We have also considered two ideal optical surveys, one with- 
out u band photometry but with a very deep exposure in 
the z and y bands; another which is very similar to the first 
but with very deep u band imaging. The depths of theses 
surveys are outlined in Table|T]and are chosen to be roughly 
consistent with the limits which the Dark Energy Survejo 
(which will survey the sky with the 4m Blanco telescope), 
Pan-STARRS 43 ' 

f which will be a collection of four 2m tele- 
scopes) and LSSTlfl (a project to survey the sky every night 
with a large 8m telescope) will be able to attain although 
the conclusions of this study are not dependent on the par- 
ticulars of these projects. 

^ http: / / decam.fnal.gov 

^ http:/ /pan-st arrs.ifa.hawaii.edu 

^ http://www.lsst.org 
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Figure 4. The rms sigma ((^(zphot — Zspec)'^^, bottom graphs) and the bias ((^{zphot — Zspec)^, top graphs) as a function of redshift for 
all the cases considered in this section (left without IR, right including IR). We can see how the inclusion of IR data improves significantly 
the data with optical exposure times but helps less data with significant optical exposure times although there is still improvement. We 
can also see the relative importance of bands by comparing surveys with deeper y and z bands and surveys with u band data. 



We measure photometric redshifts with these baseline 
surveys both including and excluding the additional infor- 
mation that a space based mission would add with infrared 
detectors. We plot our findings in Fig|2]and Fig|3]and show 
the scatter in redshift intervals in Table|2] and FigU The 
sigma is the r.m.s. photometric redshift error around the 
mean, and aes, the interval in which 68 per cent of the galax- 
ies have the smallest value of Zspec — ZpUot- We can see from 
the blue samples of Fig|2] how the increasing depth of the 
optical survey is significant in obtaining photometric red- 
shifts. We find that to obtain reliable photometric redshifts 
for a sample with an RIZ magnitude below 25 we are still 
improving significantly the quality of the photometric red- 
shifts if the overall photometry is as deep as 26 or larger. We 
note that in order to obtain reliable photo-z for galaxies with 
an RIZ depth of 25 shallower surveys such as DES or Pan- 
STARRS are not well matched to DUNE, deeper surveys are 
necessary to reproduce good photo-z on a galaxy- by- galaxy 
basis, however IR data considerably improves even the shal- 
lower surveys. 

We can also see, by analysing these scatter plots, the 
relative importance of different bands. We can assess, for 



instance, how much the deeper exposure times in the z and 
y bands would help in producing reliable photometric red- 
shifts compared to near infrared data; if we compare the 
LSST + IR case with the ideal case without u band we can 
see that out to redshift z ^ 1.6 — 1.7 the deeper z and y bands 
help and the improvement due to IR data is not as large as 
when the depths in y and z are shallower, however, for pho- 
tometric redshifts of galaxies above a redshift of z ~ 2.5 are 
only improved by the inclusion of IR data. We can under- 
stand this behaviour of the error in the photometric redshift 
estimate with the following argument: most of the informa- 
tion in photometric redshifts come from the 4000 A break 
in galaxy spectra, hence the best photometric redshift esti- 
mates are expected at the redshifts where the 4000 A break 
falls between bands with deep exposures. We can see that 
optical surveys have the best photometric redshift estimates 
around a redshift of z ^ 0.7 which corresponds to the 4000 
A break falling in the i band, hence having measurements 
of the spectrum on both sides of the break. Similarly, galax- 
ies at a higher redshift are helped by the measurement of 
the break redshifted into the IR bands. This argument can 
also be applied to the Lyman break which occurs at 912 A. 
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Figure 5. Cleaned catalogues for the 5 surveys considered before. All surveys contain IR information. As explained in Sec j3.3l we have 
removed all photometric redshifts with an error estimate larger than 0.3. After a neural network is trained it can assess the error on the 
photometric redshift looking at the error in in the photometry, this value is used for the cut presented here. We can see from all the 
scatter plots that although the error estimate from the neural networks is not the best available it is able to remove the correct galaxies 
and provide correct photo-z for some of the sample. After the cut, only 72% of the galaxies were left on the DES+ IR catalogue; 76% 
had good photo-z on the Pan-STARRS + IR catalogue; 82% of the sample on the LSST + IR catalogue; 90% and 95% of the sample 
on the Ideal + IR and Ideal + u + IR catalogues respectively. As we can see from Fig|6]a clipping of 0.3 is indeed conservative, we can 
clean the different catalogues at a higher error estimate but we make a comparison between catalogues here. We stress that this is not a 
sigma clipping, we do not use information about the spectroscopic redshift of the sample to make this cleaning procedure, we only use 
the photometry. The colour bar indicates the error estimate given by the neural network code. 



Photo-z from the ground and weak lensing from space 9 



DES (griz) 



DES (griz) + IR 



Pan-STARRS (grizy) 





-5 




spec phot 




Ideal (grizy) 



-5 

spec phot 

Ideal (grizy) + IR 




spec phot 





-5 



Pan-STARRRS (grizy) + IR 
5 




spec phot 



2 -Z , , 

spec phot 



LSST(ugrizy) + IR 




-5 



5 

sfec phot 

Ideal (ugrizy) 



Ideal (ugrizy) + IR 




2 -2 , ^ 



2 -z , , 



Figure 6. The relation between Zphot — Zspec as a function of the error predicted by the neural network Zerr based on the errors from 
photometry. We can see that the scatter points are not distributed in a Gaussian way around the centre for the shallower surveys and 
this becomes more apparent in the deeper surveys; hence the error estimate becomes more reliable the deeper the survey is. Furthermore, 
the error estimate is more reliable when IR data or u band data is obtained. We also note that in the specific case of neural network 
error estimates, with IR data the error is somewhat overestimated. These scatter plots show that in the case of DES + IR a cut at 0.3 
is appropriate to remove most outliers. 
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Figure 7. We present here scatter plots for some configurations before and after the cleaning. The top panels are in the case of DES 
photometry plus DUNE photometry, the middle panel is for LSST depth photometry without IR and the bottom panels are for the case 
of an ideal optical survey with very deep u band photometry. We can see that the cleaning is very efficient and removes outlier galaxies 
without removing good photo-z where as the same is not entirely true for the case of LSST photometry alone. For instance at some 
redshifts (around 3) some galaxies with good photo-z are removed where some outliers remain after the cleaning. The effect is much 
worse for poorer optical photometry than LSST for depths of 25 in the RIZ band. However we can see from the bottom panel that deep 
u band photometry can cure this. The conclusion is that cleaning can be an effective method to remove outliers but it is only as effective 
as the baseline of wavelengths available. Without deep IR or u band data this can lead to bad photo-z error determinations. 
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Figure 8. Fit to the unnormalised histograms of the quantity 
{z spec — Zphot) / z err • If the error estimator is rehable, the quantity 
above being one corresponds to the one sigma error Umit. There- 
fore 68% of the points should have {zspec — Zphot) I ^err smaller 
than one in absolute value and centred around zero. We plot this 
fit for six cases described in Sec j3.2l with and without IR data. 
We can see that for a cut in RIZ at magnitude 25, the shallower 
surveys produce data which leads to a biased error estimates with 
neural networks. All cases with IR data have a much better error 
estimates and therefore cleaning can be done more efficiently. We 
can see that the error estimates are less biased with the LSST case 
at this depth and are not biased at all if we have deeper u, z and 
y data. We also plot the curve exp{—x^/2) for comparison. If the 
photo-z error estimate is reliable, then the distributions shown 
here should be close to Gaussian. We can see that the DES mock 
without IR (blue curve) does not yield very reliable photo-z error 
estimates whereas the other configurations, including the DES 
mock with IR, produce more reliable results. 



This break is redshifted to the blue part of the spectrum for 
galaxies around redshift z ^ 2 — 2.5, hence u band data also 
helps photometric redshifts for galaxies in this region of the 
spectrum. 

The addition of u band data can significantly help the 
photometric redshift determination of certain galaxies. This 
information is helpful in removing catastrophic outliers that 
have a small spectroscopic redshifts and which have rela- 
tively featureless SEDs. We can see, comparing LSST with 
DES or Pan-STARRS, that there are indeed fewer outliers 
if we include u band imaging to our mock survey. However, 
comparing the LSST simulated results with the idealised 
survey including much deeper u band imaging we see that 
the u band depth has to be larger then 24.0 if we want all the 
catastrophic outliers to be removed for a RIZ survey below 
25.0; at a magnitude limit for the u band of 26.0 almost all 
the catastrophic outliers have been removed, we also provide 
numbers for a similar survey with u band as deep as 25 to 
illustrate the increment given by the u band in the range 
24.0 to 26.0. We note that these integrations on the u band 
are extremely difficult verging on the impossible even with 
the next generation surveys. We also note that the inclusion 
of u band data has a similar effect to the inclusion of IR data 
in the case of an idealised survey; the scatter including IR 
data is still significantly improved (see Tableland Fig|4|), 
but there are not many outUers with optical data only. 



Band 


Time 


Overhead 


Mapping 


Total 


u 


3.80 


1.00 


1 


3.80 


9 


1.00 


1.00 


1 


1.00 


r 


1.58 


1.01 


1 


1.60 


i 


3.02 


1.03 


1 


3.12 


z 


7.59 


1.10 


1 


8.32 


Y 


20.9 


1.02 


5.09 


106.9 


J 


19.1 


1.02 


5.09 


97.5 


H 


25.1 


1.45 


5.09 


182.7 


K 


27.5 


1.45 


5.09 


200.3 



Table 3. Time cost for obtaining data in different bands based 
on MEGAPRIME and WIRCAM on the Canada-France-Hawaii 
telescope (CFHT) relative to g band on MEGAPRIME. The time 
factor is the increase in integration time required to reach the 
same sensitivity as g band due to background and instrumental 
sensitivity. The overhead factor is the increase in overheadcom- 
paredto the g band taking into account the readout time of the 
instrument and maximum integration times due to the sky back- 
ground. The mapping factor is the increase in integration time 
required to map the same area due to the fact that IR instru- 
ments have a smaller field of view (due to the cost of the detec- 
tors). The Total is the relative expensive in total telescope time 
to reach the same depth as g band from the ground. This table 
illustrates how much time it is required to obtain IR data from 
the ground compared to optical data. 

We have also assessed in Table |2l the amount of outliers 
we get from a given survey configuration. We have calculated 
cr and (768 for each simulation. Whereas the standard devi- 
ation gives us the spread for the entire sample, cfqs does so 
only for 68% of the galaxies which have the best photomet- 
ric redshfits. Hence aes is insensitive to outliers whereas a 
is very sensitive to them. 

We note here, for example with LSST, that the u band 
requires 19 times as much integration time relative to other 
bands (specifically g) to reach the same depth. However blue 
optimised systems such as LET or CFHT can require as 
little as 4 times as much integration time in U to reach 
the same depth (depending on the exact filter choices) (see 
Tab|3] for similar factors for other bands based on CFHT 
and WIRCAM). For an optimised space based system us- 
ing UV sensitive CCDs this factor would drop to around 2. 
Specifically in the case of IR detectors; they are ten times 
more expensive (in term of cost) than CCDs to cover the 
same area. However in terms of integration time they reach 
the same depth in about the same amount of time. From 
the ground IR detectors are much less sensitive due to the 
higher backgrounds. 

3.3 Cleaned catalogues 

When a neural network is trained, we obtain an estimate 
for the error on each of the photometric redshifts predicted. 
This error is obtained the following way. For every scenario, 
the inputs of a neural network have an associated noise to 
them. We can assess the variance that this noise would intro- 
duce into the output of the network by changing the inputs 
according to the error. This will lead the the following error 
estimate: 

i 
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Figure 9. Density plots in the Zphot — Zspec plane for increasing depth of a catalogue. We have taken mock catalogues derived from the 
Cosmos survey and we made cuts in the filter F814W at several magnitude from 24.5 to 26.5 in steps of 0.5. We have assumed complete 
training sets with the same number density of training galaxies per colour volume for every case shown here. We can see that the bright 
galaxies with high signal to noise have well constrained photometric redshifts whereas the noisy galaxies have almost unconstrained 
photometric redshifts. The plots are colour coded and the scale is exponential; a colour difference corresponding to one is equivalent to 
the density being decreased by a factor of e. 



Sigma versus redshift. 
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Figure 10. The rms scatter of (zphot — Zspec) as a function of 
redshift for samples with different magnitude cuts. Same data as 
in Fig|9] As we can see brighter samples have less galaxies but also 
have much better photometric redshifts with many less outliers 
and smaller scatter. 



where the sum over i is a sum over all the network inputs. 

In order to obtain the quantity dz/drrii we can use an 
algorithm using the activ ation function for the weights of 
the network described in ([Bishopl 1 19951 ). This algorithm is 
fully incorporated in the ANNz package (|Collister Lahavl 
12004') which we use here. 

We have plotted in Fig|5]the IR catalogues cleaned con- 
servatively with a photometric error estimate of 0.3. We have 
chosen this threshold as it removes a vast majority of the 
outliers in the shallower catalogue (DES + IR). In this case 
we can see that around 30% of the sample has been removed 



from the data however the quality of the photometric red- 
shifts is almost free of catastrophic outliers. By retaining 
this conservative cut, the fraction of galaxies that are not 
removed increases according to the depth of the optical plus 
IR survey. For the ideal + u + IR case only 5% of galaxies 
are cut and there are no outliers. 

We also plot in Fig|6]a scatter diagram for 20000 galax- 
ies representing the errors estimated by the neural network 
as a function of the difference between the photometric and 
spectroscopic redshifts. This shows us the reliability of the 
error estimate. We can see that the error estimate is rela- 
tively good if there is IR data included whereas it is not opti- 
mal with optical data only; the points with low photometric 
redshift errors are concentrated around Zspec — Zphot for 
optical plus IR data but not if we have optical data only. 
In fact when IR data is included the error estimate overesti- 
mates the real error in the sample. We find that for optical 
data only, the deeper surveys can be cleaned by removing 
the higher error galaxies, however this becomes harder to 
do with the shallower surveys. This is mainly because the 
magnitude cuts for the sample have been done here at RIZ 
of 25.0 and the shallower surveys have large scatters; the 
large scatters are dominated by the fainter sources and this 
biases the error estimation. 

In order to assess whether bands are necessary to ob- 
tain good photometric redshift error estimates with neu- 
ral networks and be able to clean the catalogues efficiently 
we have plotted in Fig|8] the unnormalised histograms of 
the quantity {zspec — Zphot)/ Zerr- Each galaxy in the mock 
will have this ratio; if the error estimate is reliable then the 
shape of the histogram for this quantity should be centred 
around zero and there should be roughly 68% of galaxies 
with I Zspec — Zphot\/ Zerr < 1- Wc cau scc iu Fig |8] that the 
shallower optical survey is not able to produce data good 
enough to produce reUable photometric redshift errors at a 



Photo-z from the ground and weak lensing from space 13 



RIZ magnitude cut of 25.0. In fact it is the range of wave- 
lengths probed that produce a more rehable photometric 
redshift error estimate; i.e. by including IR data the im- 
provement is great, however by including u data or z and y 
data it is also possible to obtain good photometric redshift 
errors and clean the catalogues as shown previously. 

We show the effect of the cleaning in Fig|7] where we 
compare Zspec versus Zphot scatter plots, for a range of 
mocks, before and after the cleaning procedure. We con- 
clude that IR or u band data is very important to obtain 
good photo-z error estimators with neural networks, when 
selecting galaxies fainter than RIZ=25.0 mag. 

3.4 Impact of the source catalogue depth 

In catalogue generation we should always define a certain 
cut where we define what an object is. In this paper we have 
taken a magnitude cut in a given band to define our cata- 
logues. Here we will show how this cut influences the quality 
of the photometric redshifts and the photometric redshift 
errors that are associated to each catalogue. The larger the 
photometric signal to noise ratio, the better any code will 
be able to recover a photometric redshift for that galaxy. 
Therefore we can obtain very deep photometry and produce 
a catalogue of brighter objects which will have better photo- 
metric redshifts and associated photometric redshift errors 
or have a catalogue of fainter sources where the photometric 
redshifts and their associated errors are less reliable. 

We plot in Fig|9]the results where we have used a sim- 
ulated survey similar to the COSMOS survey as described 
in Tab HI We have used in each subplot only a subset of the 
entire set of galaxies we have simulated. The cut was done 
on the F814W filter, with depths of 24.5, 25, 25.5, 26 and 
26.5. As we can see the bright galaxies have accurate pho- 
tometric redshifts, much more accurate than the faint ones. 
Particularly, many catastrophic outliers disappear with the 
shallow integrations. This is because some bands are able 
to distinguish the real redshift of the galaxy and the catas- 
trophic error, for instance as we have already mentioned the 
u band can help remove low redshift galaxies which are as- 
signed a high photometric redshift. However, a high signal 
to noise is needed which is available for the bright galaxies 
and not for the faint ones. 

We have taken the data used in Fig|9] and plotted the 
scatter as a function of redshift in Fig jlQI As we can see 
there is a definite trend of having a lower scatter if the sam- 
ple taken has a high signal to noise ratio for the magnitude 
estimates. We can follow from the lower curve to the curve 
situated on the top how increasing the depth of a catalogue 
with the same data may produce more galaxies with photo- 
metric redshifts of worse quality. 



3.5 Impact of the training set 

We have assumed so far that the training set used to train 
the neural networks is totally representative of the testing 
set we use to produce the 2D probability densities and scat- 
ter plots to assess the photometric redshift accuracies. For 
most cases this requires a training set which is complete 
down to a magnitude limit of 25. Observationally this is a 
hard task as spectrographs have limited spectral ranges and 



the features required for redshift estimation change in ob- 
served wavelength. It should be easier to produce such a 
trainin g set down to 24 or 2 4.5 and in the redshift range 
to 1.4 (|Le Fevre et al.ll2QQ5l ). 

However the faint end and high redshift range might 
pose some problems. One efficient way to get redshift es- 
tima tes in the range 1 . 4 to 3 is in the blue optical and 
U V (jSteidel et all Il999l : iLillv et al.1 l2006l ) There are many 
metal absorption features in this range and Lyman- Alpha 
comes into the optical window at z = 1.8. This is what cur- 
rently high-z surveys are doing successfully. Another option 
is to have a large spectroscopic redshift survey in the IR 
which currently doe s not exist but will be done with FMOS 
(jPalton et al.H2006l ) in the near future. It should be noted 
that this is an optimistic scenario as the training set may 
not be as wealthy as we have estimated here. 

We also have neglected here the contamination of other 
unusual objects that might be introduced in the sample. For 
instance, low- luminosity Seyfert-1 galaxies will have strong 
lines and a different continuum shape than the usual galax- 
ies considered. These could, in reality, account for roughly a 
XX per cent of extra objects in the training and testing sets. 
We note however that a neural network is a Bayesian object 
therefore, if the colours of these unusual objects are totally 
different from the colours of the other galaxies then the net- 
work is flexible enough not to interfere on the training of 
the usual galaxies and the same result will be found. If the 
unusual objects have similar colours to the usual galaxies 
then the fact that the network is Bayesian will be to our 
advantage. Given that the contamination is only small, the 
weight of these unusual objects will be diluted as they are 
less representative. We therefore argue that a small contam- 
ination of unusual objects whichever their colour will not 
affect the photo-z of usual galaxies considerably. 

We stress here that the errors which will be associated 
to training sets arise from two terms, one is the square root 
of the number of spectra Ns available in our analysis. The 
second is the rms sigma as a function of redshift for that 
group of galaxies. Weak gravitational lensing is sensitive to 
the error on the mean of the redshift for galaxies, which is 
dependent on the quantity a'^{z)/Ns. This error is complex 
to analyse as a{z) depends on the photometric bands as well 
as the method used for photometric redshift estimation. An 
analysis of this is made in Sec|5] 

To assess the impact of an incomplete training on the 
accuracy of the photometric redshifts produced we have used 
training sets with a brighter magnitude cut and estimated 
photometric redshifts in a fainter sample. We have main- 
tained the density of training galaxies per unit of colour 
volume the same in all the runs. We have chosen a cut of 
25.5 in the F814W band and trained the neural networks 
with sets cut at 25.5, 25 and 24.5 in the same band. The 
probability density plots are shown in Fig jlll and the scatter 
as a function of redshift in Fig jl21 As we can see an incom- 
plete training produces slightly worse photometric redshifts. 
If we assume that we can extrapolate from the colours of the 
brighter objects, having only a training set complete to 0.5 
magnitudes brighter degrades the photometric redshifts by 
about 20%. There are however, as shown in Fig jlli a much 
larger number of catastrophic outliers which are mainly the 
galaxies with no representatives in the training set. 
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Figure 11. Effect of the training set depths in the photometric redshift quahty. We have taken mocks from the survey denominated 
by cosmos and we have made cuts in the filter F814W at several magnitude cuts from 24.5 to 26.5 in steps of 0.5 in magnitude. We 
have assumed complete training sets with the same number density of training galaxies per colour volume for every case shown here. We 
can see the bright galaxies with high signal to noise have well constrained photometric redshifts whereas the noisy galaxies have almost 
unconstrained photometric redshifts. The plots are colour coded and the scale is exponential; each unit corresponds to one e-fold. 



Sigma versus redshift. 
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for samples with different training sets. Same data as in Fig jlll As 
we can see the sample which has been trained with an incomplete 
training set has worse photometric redshifts with many more out- 
liers and a larger scatter, however the decrement in accuracy is 
not extremely large. 



4 COLOUR & TYPE ANALYSIS 

In this section we attempt to analyse and establish which 
types of galaxies are producing large catastrophic outliers in 
the photometric redshift analysis. This is important to assess 
which galaxy properties are introducing the larger errors in 
our analysis, this would allow us to have a greater under- 
standing of how to reduce systematic effects due to photo- 
metric redshifts and have a comprehensive understanding of 
how different bands can help the photometric redshift anal- 
ysis. 

For this purpose we choose two different catalogues to 
perform our colour and type analysis. We choose first a cata- 



logue obtained from DES like exposure times that has been 
cut in magnitude at r < 24.0, which is the depth of the 
photometry in the optical. We also choose another optical 
catalogue with an RIZ depth of 25.0 and with the specifica- 
tions of the ideal optical survey we have chosen in Sec l3.2l 

We show in the centre of Fig jl3l the scatter plot for 
the photometric redshifts as a function of the spectroscopic 
redshifts for the fourth case considered. From this figure 
we have selected four spectroscopic redshift bins (0.0-0.35, 
0.35-0.6, 0.65-0.9 and 1.6-2.3) which contain large numbers 
of ouliers. From these regions we selected all points with 



\Zphot 



> 0.3 on one of the sides of the z. 



phot 



We plot on the four panels above and below the scatter 
diagram the relative histograms of the populations within 
the regions selected relative to the average population of 
galaxies considered in the magnitude cut. This means that 
if the histograms are above one this galaxy type or Ay (the 
extinction in magnitudes in V band) is dominant in the re- 
gion selected, whereas if the histogram is below one then the 
population in the region is sub-dominant. 

We can see that in the first region, most galaxies that 
are scattered towards higher photometric redshifts have a 
high Av in our photometric catalogue. This means that 
galaxies which are not reddened have relatively good photo- 
metric redshifts whereas galaxies which are heavily reddened 
are scattered up towards high redshift. We can see from the 
second region chosen that the same occurs there, however 
this happens for different types of galaxies. The same oc- 
curs for the fourth region chosen, above a redshift of 1.6, but 
here the galaxies which are severely reddened are scattered 
towards lower redshifts. The third region we have selected is 
heavily populated by galaxies with types close to 50 which 
corresponds in our notation to very blue and young star- 
burst galaxies which are naturally hard to get photometric 
redshifts for because of their relatively featureless contin- 
uum. We conclude from these graphs that the main source 
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Figure 13. Mock data with the survey we have labelled by ideal optical survey without u band photometry. We find that despite the 
difference in the magnitude limit of the survey the regions with a high concentration of outliers contain similar types of galaxies which 
are mainly reddened galaxies and/or starburst galaxies. The main difference is that in the region 1.6 to 2.3 in redshift the outliers have 
a low fraction of elliptical galaxies. This is due to the high exposure times in the y and z bands chosen in this mock ideal survey. 
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Figure 14. Colour-Colour (g-i x r-z) diagram of the different 
galaxy types with redshifts ranging from to 2. The red points 
represent elliptical galaxies. The bluest point correspond to the 
colours of an elliptical galaxy at 2; = 0, the other points are the 
same galaxy further along in redshift up to z = 2.0. Pink, dark 
blue and green, light blue points represent Sbc, Sdc and Irr types 
of galaxies. We have also plotted the vector in this plane which 
corresponds to the shift in colour for a reddening of Ay = 1.0. 
As we can see the shift in the colour-colour plane corresponding 
to redshifting the galaxy is degenerate in some cases having a 
reddened galaxy. In other words a faint reddened galaxy at low 
redshift has very similar colours to a brighter galaxy at higher 
redshift with no reddening. Similarly a very high redshift galaxy 
which is reddened has the same colours to an intermediate redshift 
galaxy which is not reddened. 



for catastrophic photometric redshifts for an optical survey 
arises from heavily reddened galaxies as well as very blue 
starburst galaxies. 

We have performed the same analysis with similar re- 
gions for r < 24. The results are very similar to the results 
obtained for a cut one magnitude fainter. The main dif- 
ferences are that in region four with redshifts larger than 
1.6 the late type galaxies with type close to zero are well 
constrained by the optical data given the very deep expo- 
sure times in the z and y bands. Furthermore the scatter 
in that region is significantly lower. However, the less well 
constrained objects have still same properties of being either 
blue starburst or heavily reddened objects. 

There is a big degeneracy between redshift and redden- 
ing for some redshift ranges. We plot for different galaxy 
types the colour in the r-z and g-i plane for galaxies at dif- 
ferent redshifts. We also plot the vector which corresponds 
to the average shift in colour for a reddening of Ay = 1.0. 
Clearly galaxies at low redshift which are reddened have very 
similar colours to galaxies at higher redshifts which are not 
reddened. We can see with this explanation why galaxies are 



scattered upwards in the Zs 



Zphot 



plane. We have used a 



neural network to obtain photometric redshifts; given that 
the training set we have chosen is representative there will 



be more galaxies at redshift z ^ 1 than at low redshift. We 
argue that the inclusion of IR data helps the distinction be- 
tween reddened galaxies and galaxies with a low Av given 
the different extinction as a function of wavelength which is 
low at IR bands. The case which we have chosen with an 
ideal + u survey can also distinguish between galaxies with 
a high and low Ay; this is because at very high exposures 
for the u band, a detection in u would allow us to know 
whether the galaxy is reddened. This is because of the high 
extinction in the u band. 



5 WEAK LENSING TOMOGRAPHY: THE 
DEPENDENCE OF THE DARK ENERGY 
PARAMETER ESTIMATION ON THE 
PHOTO-Z ACCURACY 

5.1 A qualitative treatment of the WL-photoz 
cross talk 

One important application of photometric redshifts is for the 
analysis of weak lensing tomography (Hu 1999) , as e.g. in 
the DUNE experiment. The idea is to slice a lensing survey 
into photometric redshift bins and to analyse the cosmic 
shear for high si^nal-to-n oise galaxy image s in each photo-z 
bin, e. g. as dis cussed by Ma et al. (2006), hereafter MHH 
and bv lAmara Refregier (2006), hereafter AR. 

Specifically, we can divide the galaxy sample into photo- 
z bins and examine the effect on the derived cosmological 
parameters due to uncertainties in the photo-zs. Consider a 
distribution of sources selected from a photometric redshift 
bin which results in a more complicated (not necessarily 
Gaussian) distribution with respect to true (spectroscopic) 
redshift, with mean redshift z and variance: 



/i2 



(3) 



Assuming Poisson statistics we can predict the variance 
in the mean redshift z given Nspec spectroscopic redshifts 
associated with that photo-z bin: 



i6z) 



rms^{z) 



■ 1^2 /Ns 



(4) 



We can now model crudely the the uncertainty in de- 
riving the constant Dark Energy parameter w — P/ p from 
WL if the only uncertainty is due to photo-z errors: 



\{5w)/w\ — a{5z)/z 



(5) 



The 'fudge factor' a can be estimated from detailed 
modelling of the WL power spectrum. For example, if we 
set all other p arameters to b e kno wn in the scaling re- 
lation given in Huterer et al. 1 (|2006l . Eq.24) we find that 
z^'^ oc and hence a — 5.2. This value a 5 can 

also be justified qualitatively by examining the sensitivity 
of cosmological distance and the linear growth to variations 
in w (Peacock & Schneider 2006). 

We can now combine the last two equations to give: 



\{5w)/w\ = {a/z)^H2/Ns 



(6) 



For example, for a desired fractional error of 1% on 
a = 5 , ^ = 1 and 112 — 0.06 (derived from our mocks and 
ANNz averaged over a range of proposed optical and IR 
surveys) we find that Ns ~ 15, 000 spectroscopic redshifts 
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are required for that bin, or for say 10 bins a total of 150, 000 
spectroscopic redshifts. 

Our back-of-the-envelope calculation helps us to un- 
derstand the link between the Dark Energy parameters, 
the photometric redshift performance and the number of 
spectroscopic redshifts. However, being derived only for a 
constant w and with other cosmological parameters fixed, 
it somewhat under-predicts our detailed calculation below 
which is done for a 2-parameter equation of state after 
marginalisation ove r other parameters. 

More generally Amara & Refregier ("2006') have consid- 
ered the way the Figure of Merit (FOM) for the Dark Energy 
parameters {wo,Wa) is affected by photo-z errors alongside 
the effect of the sky coverage and depth, the shear measure- 
ments systematic and uncertainties in the non-linear power 
spectrum predictions. Their scaling relations roughly agree 
with our detailed FOM calculations described below. 





+ RIZ 


+ RIZ + IR 


DES-like 


72/83.6 


120/124.0 


Pan-4-like 


80/87.3 


132/130.8 


LSST-like 


116/112.0 


156/148.0 


Ideal-like 


136/130.5 


168/160.4 


Ideal + u-like 


164/151.0 


168/162.3 



Table 4. The FOM for the survey configurations we consider with 
and without IR data. The FOM results are for a lensing survey 
only and do not include any other prior such as CMB. We note 
here that this is not a straight comparison for the different surveys 
as we only consider galaxies which will be observed by DUNE. For 
instance the real DES survey will be only over a smaller area of 
the sky and a LSST survey would be much deeper and hence have 
many more galaxies usable for lensing. The numbers on the left 
are the numbers computed in this work using the full distribution 
of redshifts. The numbers on the right are FOM values using the 
fitting formula given by AR. 



5.2 Dark Energy Figure of Merit (FOM) 
calculation 

We summarise below the ingredients for the FOM calcula- 
tion for Dark Energy parameters from weak lensing. The 
predicted angular power spectrum Cij{l) between redshift 
bins i and j depends on the 3-D matter power spectrum 
P(k — l/r) and on the radial window functions Wi{r) and 
Wj{r) of bins (i, j): 

= / drr-^W^{r)WJ{r)P{l/r',r)+5^J(Jl/n^ , (7) 

^0 

where r is the comoving distance and th is the Universe hori- 
zon. The last term is the 'shot noise' due to (Je, the intrin- 
sic ellipticity noise for the galaxy sample, and n^, the total 
number of galaxies in the radial slice. The case i = j gives 
the auto-correlation of bin i. The window function Wi{r) de- 
pends on cosmological parameters and on the redshift distri- 
bution pi{z) of the source galaxies redshift slice, normalised 
such that fii — J dzpi{z) is the total number of galaxies 
in the slice and the integral is over the slice's radial bound- 
aries. For a comprehensive list of detailed equations in weak- 
lensing correlations we refer the reader to (|Bridle Kind 
I2OO7I I. 

Ideally, we would like to know the Pi{z) exactly from a 
spectroscopic survey, where the redshift is derived from the 
spectrum of each galaxy. In reality, deep wide field surveys, 
such as DUNE, will only provide us with multi-band imag- 
ing data which will allow us to derive photometric redshifts 
based on templates and/or spectroscopic training sets. We 
can relate the probabilities for the true redshift Zspec and 
the photometric redshift Zphot by the Bayesian rule of con- 
ditional probability: 

p{Zspec, Zphot) = p{Zspec\Zphot )p{Zphot) — p{Zphot\Zspec)p{Zspec 

Consider now a sharp cut in a photo-z bin i , i.e. we 
select only those galaxies in the range Zphot(i) < ZpUot < 
Zphot{i-\-'^) ■ We can write the probability for the true redshift 
distribution resulting from the photo-z slice as: 

Pi{Zspec) — I p{Zphotj Zspec ) dZphot (9) 



Typically pi{zspec) would have a wide spread, not in the 
form of a Gaussian, as a-symmetric tails are present due the 
photo-z catastrophic errors. 

We have now two options. One is to parameterise 
Pi{zspec) directly based on the projection of the photo-z 
scatter diagram pi{zspec, Zphot), derived from a spectroscopic 
training set or mock catalogues. The second option is to 
model it using eq. JS]): 

Pi^Zspec) — I p{Zspec)p{Zphot\Zspec ) dzphot, (10) 

where p{zspec) is the overall galaxy redshift distribution and 
p{zphot\zspec) can be modelled, somewhat ad- hoc, as a Gaus- 
sian (e.g. MHH, AR). 

Here we prefer the first option, i.e. we use the actual 
distributions of spectroscopic redshifts coming out of the 
simulations for the analysis. 

The uncertainties in the shapes of the probability dis- 
tribution function also have to be taken into account, due to 
the finite number, Ns, of the spectroscopic redshifts per bin 
in the training set. We have assumed an uncertainty on the 
mean and on the variance of the distributions we have from 
the photo-z simulations for each redshift shell in our analy- 
sis. As explained in the previous sub-section, from Poisson 
statistic we can predict the variance in the mean redshift of 
the bin: 

rmS^{z) = fi2{Zspec)/Ns (11) 

where fik{zspec) — {{zspec — z)^) for a given photometric 
redshift bin. Similarly for the variance in the variance 

rms'{n2) = i^[(iV,-l)M4-(iV.-3)Mi] ^ ^^^-^ 

One can marginalise over both these uncertainties. The 
dependence on the number of spectroscopic redshifts is really 
an indirect expression of the scatter in z and in /J2- 

The Dark Energy equation of state is commonly written 
as: w{a) = wq -\- {1 — a)wa- One can also define a^, the 
pivot value at which the uncertainty in w{a) is the minimum. 
Accordingly we define Wp — wo + (1 — ap)wa. Armed with 
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this parameterisation we can now apply the Fisher matrix 
formahsm. We define as: 



FOM : 



(det^-^)V2 



6Wr)6Wa 



(13) 



where i and j denote the elements of the covariance ma- 
trix {F~'^) that contain the equation of state parameters 
{wp,Wa)) and Swp and Swp are the 68% errors on Wp and 

Wa. 

In our Fisher matrix analysis we vary the following cos- 
mological parameters: the dark matter content Qrn, the dark 
energy parameters wq, Wa, the Hubble constant divided by 
100 the amplitude of fluctuations at 2; = as, the bary- 
onic matter content Qb and the scalar index of the primordial 
spectrum n where the fiducial values are 0.3, -0.95, 0.0, 0.7, 
0.8, 0.045 and 1.0 respectively. We assume spacial flatness. 
As we have already stated we have also assumed that there 
is an uncertainty over the mean and variance for each of 
the redshift bins independently. Hence we have two extra 
nuisance parameters that we will marginalise over for each 
photometric redshift bin. 

When performing lensing measurements, cuts are made 
on the detected galaxies. The two most significant cuts are 
in flux, where galaxies below a certain magnitude threshold 
are not used and a size cut where small galaxies are rejected 
because their shape is hard to measure. In this work we 
have not included a size cut since this would require detailed 
image simulations which is beyond the scope of this paper. 
However the effect of this cut is important since this will 
remove high redshift galaxies from our sample. To mimic the 
effect of these cuts we randomly sample the galaxies in our 
mock and apply a selection so that our galaxy population has 
a more realistic PDF. Specifically we impose a distribution 
(P(z) oc exp( — (z/^^o)"^"^)), where zq is set by the median 
redshift of the galaxies (zq = Zrn /I -412), which we assume 
to be Zm — 0.9. 

We have performed all the analysis with five photomet- 
ric redshift bins and also checked that having more pho- 
tometric redshift bins did not improve considerably on the 
FOM. We have chosen the photometric redshift bins so that 
the number of galaxies in each bin was the same. The total 
number of galaxies considered was 35 gal/arcmin^ and we 
have considered a survey covering 20000 deg^. 



5.3 FOM results 

We now investigate the role of the IR filters in the FOM 
prediction. We can clearly see that the role of the IR filter 
depends on the quality of the optical data available. We 
found before that the scatter plots for the photo-z benefited 
significantly by the inclusion of IR data. Here we find that if 
we have poor optical data with a depth similar or shallower 
than the target depth of 25 in the RIZ filter, then the IR 
bands can improve the FOM by a factor of 1.7. However if 
the optical data available is deeper than the target depth 
of 25 in the RIZ filter, especially in the z and y bands then 
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Figure 15. The FOM as a function of the number of spectro- 
scopic redshifts (per bin) used to calibrate the photometric re- 
sults. We can see that the FOM began to level off at 10^ and 
having more than 10^ spectroscopic redshifts does not increase 
the FOM thus meaning that the photo-z are well enough cali- 
brated. We conclude that for a DUNE-like survey we will need 
around 10^ objects in each redshift bin (here we assumed 5 bins) 
to calibrate the photometric redshift. Around 10^ objects might 
be sufficient, however the use of Eqnjllldoes not take account of 
outliers or the non-Gaussianity of the distribution and a number 
closer to 10^ will be necessary. Furthermore these redshifts must 
be representative of the sample and therefore most of them should 
be of galaxies in the faint end of the catalogue, i.e. fainter than 
RIZ of 24. 

this FOM improvement|3 is reduced to 1.3 and if very deep 
u band data are available then the improvement is minimal 
for the purposes of Dark Energy determination. 

All the numbers above assume an infinite number of 
spectroscopic redshifts are available to calibrate the photo- 
metric data. In practise only a finite number is necessary 
in order to calibrate the data well enough so that there is 
no degradation of the FOM. In order to asses this we have 
performed Fisher matrix calculations where we have prior 
knowledge given by Ns, the number of spectroscopic galax- 
ies. We show the results in Fig jl51 As we can see if we have 
around 10^ galaxies in each of the photometric redshift bins 
that we have assumed we have very little degradation of the 
FOM. We therefore conclude that we will require that many 
spectroscopic redshifts to calibrate the photometric sample 
well enough. Obtaining this many spectra is not a daunting 
process. However, we emphasise that most of these spectro- 
scopic redshifts should be of galaxies in the faint end of the 
sample which will be the most numerous. A sample of 10^ 
galaxies at high brightness would not be suitable. As we dis- 
cussed in Sec j3.5l this spectroscopic sample is currently not 



^ The Dark Energy Task Force report (|Albrecht et al.ll2006h de- 
fined The Figure of Merit slightly differently as the reciprocal 
area in the wq — Wa plane that encloses 95% CL region. It differs 
to our FOM by a constant factor. 



^ In comparison the DEFT report (|Albrecht et al.|[2006l ) suggests 
that a new generation of surveys should improve the FOM by a 
factor of 3. Here the improvements with IR are a factor of 1.3 to 
1.7 within the same survey generation. 
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Figure 16. The fraction of galaxies removed from each of the 
surveys considered if we apply a cut in the photometric redshift 
error. As we can see the inclusion of the IR bands increases sig- 
nificantly the accuracy of the error determination in our analysis. 
We also note that among the mock sets with simulated optical 
data only, the one with u band data is also reliable in obtaining a 
reliable error estimate. Furthermore as it is argued it is possible 
to clean photo-z catalogues removing outliers without decreasing 
significantly the FOM for Dark Energy parameters. 



Figure 17. The FOM as a function of the photometric redshift 
error cut. It can be noted that if we have a survey with deep IR 
bands or a survey with deep u band photometry then performing 
a cut to remove outliers does not decrease the FOM significantly. 
This is clearly a function of how deep the optical survey is. We 
can see in Fig|5]the quantity of outliers left on the mocks with a 
cut of 0.3. 



available, however, with IR surveys probing galaxies at a 
redshift of 2 combined with surveys such as zCOSMOS the 
prospects of having such a sample in the next decade is not 
unfeasible. However, there is need for more detailed study 
to assess whether this sample will be adequate in terms of 
completeness to be able to calibrate photometric redshifts. 

We have listed in Tab |4] the results for each survey and 
compared them with results given by the fitting formula 
from AR. We have used the values of sigma 68 as the val- 
ues for the scatter for this distribution and we defined the 
fraction of outliers as the galaxies which were situated Saes 
or more away from the mean photometric redshift of that 
bin. We can see from the results of the table that the full 
calculations assuming the full PDF give very similar results 
to the fitting formula from AR. 

5.4 Other considerations: systematic effects. 

We have assessed the impact on the statistical errors that 
different photometric redshift distributions would have on 
constraining the Dark Energy parameters. In a weak lensing 
survey, however, there will be further systematic barriers 
which would not allow this statistical limit to be reached. 

One of the important effects that will have to be re- 
moved or modelled and fitted for is the effect of intrinsic 
alignments for close-by galaxies. The intrinsic-intrinsic (II) 
power spectrum introduced by galaxy intrinsic alignments 
can be written as 

C!,{1) = / drr-'Mr)pj{r)Pii{l/r;r), (14) 

where pi is the redshift distribution for the galaxies in bin i 
and Pi I is the intrinsic alignment power spectrum. We can 
see that if pi and pj are independent the contamination of 
the II term is minimal. Therefore weak lensing surveys would 
require a low overlap of galaxies between different bins. Sev- 



eral ideas have been propose d recently to imp rove the per- 
formance of the WL analysis. Ijain et al ] (|2006h suggested a 
'colour tomography' to bin the galaxy data in colour space 
where galaxies have small overlap in the pi, rather than to 
generate a photo-z catalogue and then bin it. 

It is beyond the scope of this paper to do a complete 
analysis of how these systematic effects will hinder the FOM 
for the Dark Energy parameters. We refer the reader to 
(iBridle k Kindl2007r ) for a more detailed analysis of this. We 
have assessed to what extent a detailed analysis of photo- 
metric redshift errors would help decrease the contamination 
of the II term in weak lensing. We have run Fisher matrices 
for catalogues which were cut in increasing redshift error. 
That is to say that galaxies that are found to have high er- 
ror estimates are removed from the sample, this can be seen 
in Fig jl6l We plot the fraction of galaxies left in this sample. 
Consequentially, there are less systematic errors as there are 
fewer outliers in the sample. We can see from Fig |5] how this 
affects the number of outliers in the photo-z analysis for a 
given cut in the estimated error. 

We plot in FiglTTlhow the FOM is decreased by a photo- 
z error cut for some of the surveys considered. We can see 
clearly that some surveys can have a large cut in the pho- 
tometric redshift error and their FOM remains almost un- 
changed. This clearly means that we will be removing sys- 
tematic errors due to the overlap of the photometric redshift 
bins but not hitting the FOM by removing galaxies which 
introduce relevant information in the determination of Dark 
Energy. Furthermore, if there is a need to model a galaxy- 
intrinsic (GI) alignment contribution to the cosmic shear 
signal, this will be more reliable if we obtain a sample with 
only the reliable photometric redshifts and which does not 
decrease the FOM significantly compared to the full sample. 
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6 CONCLUSIONS 

In this work we have looked at the role of optical and near- 
IR photometry in the context of weak lensing tomography. 
In particular we have quantified how the Figure of Merit 
for Dark Energy parameters is affected by the choice of fil- 
ters and observing conditions. We have generated catalogues 
from a range of proposed surveys. For fixed mock simula- 
tions and a fixed photo-z method (ANNz) we explored the 
photo-z accuracy and systematics by varying the set of fil- 
ters and the magnitude limit of the surveys. The aim of this 
was to compare the role of bands and to reduce biases given 
that different mocks and photo-z methods may give different 
answers. 

From the photometry, we find that there is an interplay 
between the choice of filters (in particular the J, H and the 
u), the depth (i.e. magnitude limit), and the removal of out- 
liers based on the photo-z errors. We find that if we are to get 
IR data from space the improvements are greatly dependent 
on the quality of the ground data available. For surveys that 
go down to the same magnitude limit as the lensing survey 
in the RIZ band the improvements that the J and H bands 
bring are great. However, if we have ground based photome- 
try, particularly in the z and y bands, the improvements due 
to IR data becomes smaller. There is a trade-off between the 
u and the IR, meaning that the u band, provided it is deep 
enough, can play a similar role as the IR data. 

In summary, our main conclusions are: 

• The addition of J+H to griz+RIZ dramatically reduces 
the scatter in individual photo-z, in particular for the shal- 
low griz surveys. 

• The u band filter is effective in removing outliers and 
can play the same role as the IR filters but only if the RIZ 
depth is significantly larger than the depth of the lensing 
survey chosen. 

• The results presented here depend on galaxy formation 
scenarios which are encoded into the mock catalogues. The 
main source for catastrophic photometric redshifts for an 
optical survey arises from heavily reddened galaxies as well 
as very blue starburst galaxies. It is hard to distinguish be- 
tween a higher redshift galaxy and a lower redshift reddened 
galaxy with optical colours only below z ^ 1.2. The opposite 
is true above z ^ 1.7. The inclusion of u band data or IR 
data breaks this degeneracy. 

• Our derived Figure of Merit FOM = \bw^8waY^ ^ 160 
can be obtained with a realistic mock catalogue solely with 
weak lensing data. This would be a significant increase over 
current estimates and also the next generation of surveys. 
For comparison a current estimate of the error on w is 
[6wp]-^ - 10. 

• Given an ambitious ground based survey such as the 
LSST + IR bands from space, only a marginal improvement 
in FOM can be achieved by increasing the photometric ac- 
curacy of the visible bands (i.e. the inclusion of deep u band 
photometry and deeper z and y bands only increases the 
FOM fractionally from 156 to 168 (see Table |4|). Since in- 
creasing the accuracy of the visible bands will likely require 
deep space lensed data, the extra cost of this is difficult to 
justify. This means that LSST is the ideal counterpart to a 
DUNE like survey. However deeper spaced based missions 
might have other ideal ground based photometric matches. 

• The FOM improvement obtained from the addition of 



IR data depends on the quality of the ground based optical 
data. The FOM is increased by a factor ranging from 1.7 
down to 1.3 for realistic mock surveys. 

• The required number of spectroscopic redshifts needed 
depends on the number of galaxies to train a neural network 
and also the quantity of galaxies needed to calibrate the 
photometric redshifts. We argue that a value around 10^ in 
each redshift bin will be necessary for weak lensing studies 
from future space based missions. 

• The 'cleaning' of outliers is effective. There is a trade- 
off between reducing the photo-z error by removing galaxies, 
but increasing the shot noise. It is possible to clean a photo- 
z catalogue without decreasing the FOM significantly. We 
conclude that this is an effective way to decrease systematic 
effects from a weak lensing survey and is an alternative to 
colour tomography. 

The general conclusion from our study is that combin- 
ing weak lensing measurements from space and photometric 
redshifts from optical ground-based data is indeed a very at- 
tractive way to constrain Dark Energy properties. Further- 
more, the use of IR from space can significantly improve the 
accuracy of Dark Energy measurements by 30-70 percent. 
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